AITopics | generating training time adversarial data

Learning to Confuse: Generating Training Time Adversarial Data with Auto-Encoder

Neural Information Processing SystemsDec-25-2025, 02:30:54 GMT

In this work, we consider one challenging training time attack by modifying training data with bounded perturbation, hoping to manipulate the behavior (both targeted or non-targeted) of any corresponding trained classifier during test time when facing clean samples. To achieve this, we proposed to use an auto-encoder-like network to generate such adversarial perturbations on the training data together with one imaginary victim differentiable classifier. The perturbation generator will learn to update its weights so as to produce the most harmful noise, aiming to cause the lowest performance for the victim classifier during test time. This can be formulated into a non-linear equality constrained optimization problem. Unlike GANs, solving such problem is computationally challenging, we then proposed a simple yet effective procedure to decouple the alternating updates for the two networks for stability. By teaching the perturbation generator to hijacking the training trajectory of the victim classifier, the generator can thus learn to move against the victim classifier step by step. The method proposed in this paper can be easily extended to the label specific setting where the attacker can manipulate the predictions of the victim classifier according to some predefined rules rather than only making wrong predictions. Experiments on various datasets including CIFAR-10 and a reduced version of ImageNet confirmed the effectiveness of the proposed method and empirical results showed that, such bounded perturbations have good transferability across different types of victim classifiers.

classifier, generating training time adversarial data, victim classifier, (9 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.59)

Add feedback

Learning to Confuse: Generating Training Time Adversarial Data with Auto-Encoder

Ji Feng, Qi-Zhi Cai, Zhi-Hua Zhou

Neural Information Processing SystemsOct-2-2025, 07:23:01 GMT

Unlike GANs, solving such problem is computationally challenging, we then proposed a simple yet effective procedure to decouple the alternating updates for the two networks for stability.

artificial intelligence, classifier, machine learning, (19 more...)

Neural Information Processing Systems

Country: Asia > China (0.28)

Genre: Research Report > New Finding (0.69)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Reviews: Learning to Confuse: Generating Training Time Adversarial Data with Auto-Encoder

Neural Information Processing SystemsJan-22-2025, 03:09:37 GMT

Post Response Comment: I think the authors have addressed my initial concerns, therefore I maintain my initial stand and incline to accepting it. Originality The setting is new as far as my knowledge can tell. Previous work such as "Certified Defense for Data Poisoning Attacks" considers contaminated instance within a feasible set, but modifying each training point by a small amount for an offline learner is new to me. I saw a backdoor attack in reference ([5]), but it is not referred to in the main body. I think the difference between this attack and the backdoor attack is that this one doesn't require the backdoor pattern to activate during test-time.

auto-encoder, generating training time adversarial data, learning, (2 more...)

Neural Information Processing Systems

Industry: Information Technology > Security & Privacy (0.63)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.40)

Add feedback

Reviews: Learning to Confuse: Generating Training Time Adversarial Data with Auto-Encoder

Neural Information Processing SystemsJan-22-2025, 03:09:27 GMT

The paper proposes a novel algorithm to hijack the training process so the trained model performs very bad. This is an important topic and all the reviewers agreed that this paper should be accepted.

auto-encoder, generating training time adversarial data, learning

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.40)

Add feedback

Learning to Confuse: Generating Training Time Adversarial Data with Auto-Encoder

Neural Information Processing SystemsOct-9-2024, 15:15:51 GMT

In this work, we consider one challenging training time attack by modifying training data with bounded perturbation, hoping to manipulate the behavior (both targeted or non-targeted) of any corresponding trained classifier during test time when facing clean samples. To achieve this, we proposed to use an auto-encoder-like network to generate such adversarial perturbations on the training data together with one imaginary victim differentiable classifier. The perturbation generator will learn to update its weights so as to produce the most harmful noise, aiming to cause the lowest performance for the victim classifier during test time. This can be formulated into a non-linear equality constrained optimization problem. Unlike GANs, solving such problem is computationally challenging, we then proposed a simple yet effective procedure to decouple the alternating updates for the two networks for stability.

classifier, generating training time adversarial data, victim classifier, (7 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.64)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.61)

Add feedback

Learning to Confuse: Generating Training Time Adversarial Data with Auto-Encoder

Feng, Ji, Cai, Qi-Zhi, Zhou, Zhi-Hua

Neural Information Processing SystemsMar-19-2020, 01:31:55 GMT

In this work, we consider one challenging training time attack by modifying training data with bounded perturbation, hoping to manipulate the behavior (both targeted or non-targeted) of any corresponding trained classifier during test time when facing clean samples. To achieve this, we proposed to use an auto-encoder-like network to generate such adversarial perturbations on the training data together with one imaginary victim differentiable classifier. The perturbation generator will learn to update its weights so as to produce the most harmful noise, aiming to cause the lowest performance for the victim classifier during test time. This can be formulated into a non-linear equality constrained optimization problem. Unlike GANs, solving such problem is computationally challenging, we then proposed a simple yet effective procedure to decouple the alternating updates for the two networks for stability.

classifier, generating training time adversarial data, victim classifier, (7 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.64)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.61)

Add feedback

Learning to Confuse: Generating Training Time Adversarial Data with Auto-Encoder

Feng, Ji, Cai, Qi-Zhi, Zhou, Zhi-Hua

arXiv.org Machine LearningMay-22-2019

In this work, we consider one challenging training time attack by modifying training data with bounded perturbation, hoping to manipulate the behavior (both targeted or non-targeted) of any corresponding trained classifier during test time when facing clean samples. To achieve this, we proposed to use an auto-encoder-like network to generate the pertubation on the training data paired with one differentiable system acting as the imaginary victim classifier. The perturbation generator will learn to update its weights by watching the training procedure of the imaginary classifier in order to produce the most harmful and imperceivable noise which in turn will lead the lowest generalization power for the victim classifier. This can be formulated into a non-linear equality constrained optimization problem. Unlike GANs, solving such problem is computationally challenging, we then proposed a simple yet effective procedure to decouple the alternating updates for the two networks for stability. The method proposed in this paper can be easily extended to the label specific setting where the attacker can manipulate the predictions of the victim classifiers according to some predefined rules rather than only making wrong predictions. Experiments on various datasets including CIFAR-10 and a reduced version of ImageNet confirmed the effectiveness of the proposed method and empirical results showed that, such bounded perturbation have good transferability regardless of which classifier the victim is actually using on image data.

artificial intelligence, classifier, machine learning, (18 more...)

arXiv.org Machine Learning

1905.09027

Country: Asia > China > Jiangsu Province > Nanjing (0.04)

Genre: Research Report > New Finding (0.87)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Filters

Collaborating Authors

generating training time adversarial data

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Learning to Confuse: Generating Training Time Adversarial Data with Auto-Encoder

Learning to Confuse: Generating Training Time Adversarial Data with Auto-Encoder

Reviews: Learning to Confuse: Generating Training Time Adversarial Data with Auto-Encoder

Reviews: Learning to Confuse: Generating Training Time Adversarial Data with Auto-Encoder

Learning to Confuse: Generating Training Time Adversarial Data with Auto-Encoder

Learning to Confuse: Generating Training Time Adversarial Data with Auto-Encoder

Learning to Confuse: Generating Training Time Adversarial Data with Auto-Encoder